mmgenome: a toolbox for reproducible genome extraction from metagenomes
نویسندگان
چکیده
Summary: Recovery of population genomes is becoming a standard analysis in metagenomics and a multitude of different approaches exists. However, the workflows are complex, requiring data generation, binning, validation and finishing to generate high quality population genome bins. In addition, several different approaches are often used on the same dataset as the optimal strategy to extract a specific population genome varies. Here we introduce mmgenome: a toolbox for reproducible genome extraction from metagenomes. At the core of mmgenome is an R package that facilitates effortless integration of different binning strategies by collecting information on scaffolds. Genome binning is facilitated through integrated tools that support effortless visualizations, validation and calculation of key statistics. Full reproducibility and transparency is obtained through Rmarkdown, whereby every step can be recreated. Availability and implementation: The binning framework of mmgenome is implemented in R. Wrapper scripts for data generation and finishing is written in Perl. The mmgenome toolbox and associated step-by-step guides are available at http://madsalbertsen.github.io/mmgenome/. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
منابع مشابه
De novo extraction of microbial strains from metagenomes reveals intra-species niche partitioning
Background We introduce DESMAN for De novo Extraction of Strains from MetAgeNomes. Metagenome sequencing generates short reads from throughout the genomes of a microbial community. Increasingly large, multi-sample metagenomes, stratified in space and time are being generated from communities with thousands of species. Repeats result in fragmentary co-assemblies with potentially millions of cont...
متن کاملLinking pangenomes and metagenomes: the Prochlorococcus metapangenome
Pangenomes offer detailed characterizations of core and accessory genes found in a set of closely related microbial genomes, generally by clustering genes based on sequence homology. In comparison, metagenomes facilitate highly resolved investigations of the relative distribution of microbial genomes and individual genes across environments through read recruitment analyses. Combining these com...
متن کاملStrain/species identification in metagenomes using genome-specific markers
Shotgun metagenome sequencing has become a fast, cheap and high-throughput technology for characterizing microbial communities in complex environments and human body sites. However, accurate identification of microorganisms at the strain/species level remains extremely challenging. We present a novel k-mer-based approach, termed GSMer, that identifies genome-specific markers (GSMs) from current...
متن کاملmetaMicrobesOnline: phylogenomic analysis of microbial communities
The metaMicrobesOnline database (freely available at http://meta.MicrobesOnline.org) offers phylogenetic analysis of genes from microbial genomes and metagenomes. Gene trees are constructed for canonical gene families such as COG and Pfam. Such gene trees allow for rapid homologue analysis and subfamily comparison of genes from multiple metagenomes and comparisons with genes from microbial isol...
متن کاملMetaproteomics: Evaluation of protein extraction from activated sludge.
Metaproteomic studies of full-scale activated sludge systems require reproducible protein extraction methods. A systematic evaluation of three different extractions protocols, each in combination with three different methods of cell lysis, and a commercial kit were evaluated. Criteria used for comparison of each method included the extracted protein concentration and the number of identified pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016